Simple Extensions of Combinatorial Structures 



Robert Brignall* 
Department of Mathematics 
University of Bristol 
Bristol, BS8 ITW 
England 

Nik Ruskuc 

Department of Mathematics and Statistics 
University of St Andrews 
St Andrews, KY16 9SS 
Scotland 

Vincent Vatter^ 
Department of Mathematics 
Dartmouth College 
Hanover, New Hampshire 03755 
U.S.A. 



An interval in a combinatorial structure R is a set / of points which 
are related to every point in i? \ J in the same way. A structure is 
simple if it has no proper intervals. Every combinatorial structure 
can be expressed as an inflation of a simple structure by struc- 
tures of smaller sizes — this is called the substitution (or mod- 
ular) decomposition. In this paper we prove several results of 
the following type: An arbitrary structure S of size n belonging 
to a class C can be embedded into a simple structure from C by 
adding at most f{n) elements. We prove such results when C 
is the class of all tournaments, graphs, permutations, posets, di- 
graphs, oriented graphs and general relational structures contain- 
ing a relation of arity greater than 2. The function f{n) in these 
cases is 2, [log2(« + l)l, \{n + l)/2], \{n + l)/2], [log4(n + l)l, 
[log3(n + 1)] and 1, respectively. In each case these bounds are 
best possible. 

* Supported by the Heilbronn Institute for Mathematical Research 

'''This research was conducted while V. Vatter was a member of the School of Mathematics and Statistics at 
the University of St Andrews, supported by EPSRC grant GR/S53503/01. 
Date: May 4, 2010 

Key words and phrases. 05C20, 06A06, 05C25, 05A05 (primary), 03C13 (secondary) 

AMS 2000 Subject Classification, indecomposable graph, modular decomposition, prime graph, simple 
permutation, simple poset, simple tournament, substitution decomposition 



1 



Simple Extensions of Combinatorial Structures 



2 



1. Introduction 

Relational structures - objects governed by a given set of relations over some ground set 
- provide a general setting in which to study a wide variety of combinatorial structures 
including graphs, tournaments, posets and permutations. For each of these types of struc- 
ture, the substitution decomposition (also called the modular decomposition) can be used to 
describe larger objects in terms of smaller ones. The elemental building blocks arising in 
this decomposition are the simple^ structures. 

In this paper, rather than decompose a combinatorial structure to describe it in terms 
of smaller simple ones, we will be embedding structures into larger simple ones. In par- 
ticular, we consider the following question: given a class C of relational structures, what is 
the minimal function f{n) so that any structure in C of size n can be embedded in a simple 
structure from C containing at most f{n) additional elements? 

This question has its origins in the early 1970s with Erdos et al. [8], who showed in the 
case of tournaments that /(n) = 2 for all n. In a subsequent paper [9] they also showed 
that the only tournaments which could not be embedded in a simple tournament with 1 
additional vertex were transitive tournaments with an odd number of vertices. The case 
of complete graphs was also answered around the same time in Sumner's thesis [23], who 
showed that /(n) = [log2(n + 1)] . In this paper, we answer the question completely for 
many standard combinatorial structures, in addition to more general relational structures 
containing relations of higher arity. These results rest on a unifying framework that uses 
the substitution decomposition, but each class of structures presents different problems 
and this is reflected in the variety of answers attained. 

For the rest of this section, we will introduce important definitions and present the 
standard framework for our approach to the problem. In Section 2 we show how the case 
of tournaments fits into the framework, then in Sections 3 — 6 we demonstrate the cases 
of graphs, permutations, posets, digraphs and oriented graphs. Section 7 discusses more 
general results for higher arity relational structures, and some concluding remarks are 
given in Section 8. 

A k-ary relation R on a set A is a subset of A^. Informally, a relational structure is an 
ordered sequence of relations over some set A. More specifically, we may define a relational 
language, C, to be a set of relational symbols, and for each symbol R there is a positive integer 
Hr denoting the arity of R. A relational structure A whose relational symbols are those of 
C is then defined by its ground set A = dom(^) and a set of subsets R"^ C A"^ for each 
R G C. Such a structure may also be called an C-structure. When it does not cause any 
confusion, we will omit the superscript A from relations. 

Most standard combinatorial structures can be viewed as relational structures; they 
differ in the number of relations involved, their arities, and extra properties they are re- 
quired to posses. This viewpoint offers a unifying framework for investigating different 
combinatorial structures 'in parallel'. 

Example 1.1. A graph G can be viewed as a relational structure with a single binary operation 
^The terms indecomposable, prime, and primitive have been used in place of simple in other contexts. 
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which is required to be symmetric and irreflexive. Thus, in this case the language is £ = {E, ue = 
2}, dom(^) = V(G), and {x,y) S E if and only if x ~ y in G. 

Example 1.2. Here are some further combinatorial structures viewed as relational structures: 

(i) Oriented graphs: L = {— = 2} where is asymmetric, i.e. (x — > y) => (y 7^ x). 

(ii) Tournaments: C = n^ = 2}, but where is trichotomous: for any x, y, precisely one 
of X = y, x ^ y or y ^ x is true. 

(Hi) Digraphs: C = {— >, n^ = 2}, a 'generic' structure with a single binary relation. 

(iv) Posets: £ = {<,n< = 2}, where < is asymmetric and transitive. 

(v) Linear orders: C = {<,n< = 2}, where < is asymmetric, transitive and trichotomous. 

(vi) Permutations: £ = {<, ^,n< = 2,n^ = 2}, where each of < and ~< are linear orders. For 
a permutation n of \n\ these two orders are defined as follows: < is the normal ordering on 
[n], while i ~< j if and only if7z{i) < 7z{j). 

A set I C A = dom(^) is an interval if for every R G £, all x,y E I, and all (mr — 1)- 
tuples (xi, . . . , x,_i, x,_|_i, . . . , x„j^) G \ Pr^'^ we have 

{Xi,..., Xi_i, X, Xi+i, Xn^ ) G -R-^ [xi,..., x,_i, y, x,-+i, ...,Xn^)eR^. 

Intuitively, in relationships involving elements outside an interval I, any two elements of 
I are interchangeable. Every singleton set {x} C A is an interval, as is all of A and the 
empty set. All other interval are said to be proper, and a structure is simple if it has no 
proper intervals. 

Let A and B be two structures over the same language £. We say that ^ is a substructure 
of B, or, equivalently, that B is an extension of ^, if A C B and R-^ = \a for every R G £. 
If, in addition, B is simple, we say that is a simple extension of A. Our aim is therefore 
to show that in a class of /^-structures C, for every A ^ C there exists a simple extension 
B ^ C of A, and more importantly, to bound | B \ A | as a function of | A | . 

To establish embeddings into simple /^-structures we typically need to look at the in- 
tervals of A. This is easier to establish by first defining the reverse process. Given an 
/2-structure S, we say that A is an inflation of S by the /^-structures {As : s G S}, denoted 
A = S[As ■ s G S], if ^ is obtained by replacing each element s G S with a set of elements 
As = dom(^s) (assuming beforehand, of course, that all these domains are disjoint) that 
form an interval in the A. More precisely, A = dom A^ = Uses ^s, and for every R E £, 
and every ng-tuple (fli, . . . , an^. ) G A where fl, G A,,, we have 



If A is an inflation of S, we also say that 5 is a quotient of A. We are particularly interested 
in the case where iS is simple — Theorem 1.4 below gives the imiqueness of such an S, 
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which will be called the simple quotient of A. This decomposition is the substitution (or 
modular) decomposition. 

The notion of modular decomposition dates back at least to a 1953 talk of Fraisse. How- 
ever, only the abstract of this talk [11] survives. The first article using modular decomposi- 
tions seems to be Gallai [12] (for an English translation, see [13]), who applied them to the 
study of transitive orientations of graphs. Since then it has reappeared in settings rang- 
ing from game theory to combinatorial optimisation - see Mohring [18] or Mohring and 
Radermacher [20] for extensive references. Before stating and proving the general unique- 
ness theorem, we quote two basic facts about intervals, which follow from elementary 
considerations. The first of these facts is proved explicitly by Foldes [10]. 

Proposition 1.3. For any two intervals I and J of the C-structure A: 

(a) If in J ^ (D then I n J and I Li J are intervals of A. 

(b) I\J is an interval of A. 

Theorem 1.4. Let A be an C-structure for some language C. There exists a unique simple £.- 
structure S such that A = S[As : s G S]. Moreover, when \S\ > 2, the structures As are also 
uniquely determined. 

Proof. Let M denote the set of all intervals, except A = dom(^), which are not contained 
in another proper interval. 

If two intervals 7, / G M intersect, then Proposition 1.3 (a) shows that 7 U / is also an 
interval, which, unless / U / = A, contradicts the definition of M. If / U / = A, then 
Proposition 1.3 (b) shows that / \ 7 is an interval, so A can be written as the inflation 
iS[^i,^2] of a two-element structure S (with S = {1,2}). Note that every two-element 
structure is obviously simple. 

For uniqueness in this case, suppose that A = T[A[ : f G T] is another modular 
decomposition of A. Let ti, G T be such that A\^ n Ai 7^ 0, A\^ n A2 7^ 0. Suppose first 
that |r| > 2, and let f G T \ {fi, ^2}- Clearly, at least one of the sets A[ n Ai or A[ n A2 
is non-empty; without loss of generality we assume that A[ fl Ai 7^ 0. Let U = {« G 
T : A[^r\Ai ^ (Z)}\ {12}. Then it is easy to see that U is an interval in T. But t, ti G U, 
contradicting the simplicity of T. Thus T = {fi,f2}/ and it is easy to check that T = S 
under the isomorphism 1 1— ?► fi, 2 1— > f2- 

Now consider the case where no two intervals in M intersect. Clearly, now the sets in 
M partition A. For each J G M choose a representative Xj G I, and define the >C-structure 
S to be the restriction of the structure A to the set S = dom(tS) = {xj : I G M}. Clearly A 
is the inflation of S by the corresponding intervals of A. The simplicity of S follows from 
the observation that if <S contained a proper interval K, then \J{I : Xj G K} would be a 
proper interval of A contradicting the definition of M. 

To show uniqueness of S and all in this case, suppose that we have another modular 
decomposition A = T[A\ : i G T]. Since the intervals in M are maximal and disjoint, each 
A[ is contained in some member of M. Suppose I ^ M contains several such intervals, I = 
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U . . . U . It is then easy to verify that {ti, . . . ,tj} is an interval inT, a contradiction. 

□ 

The two cases in the above proof will be referred to as the degenerate and non-degenerate 
substitution decompositions. The degenerate case can give rise to non-uniqueness of this 
decomposition, and this may be dealt with in several ways, either by specifying the prop- 
erties that one of the two substructures must possess (for example, the substructure must 
not itself be decomposable in the same way), or by decomposing the structure into as 
many pieces as possible (for example, dividing a disconnected graph into its connected 
components). 

We use the former treatment here, but note that the latter is often better suited for cer- 
tain problems, particularly in relation to the substitution decomposition tree of a structure A: 
each node of this tree corresponds to a substructure of A whose ground set is an interval, 
with the root of the tree being A and the leaves being the singleton ground sets. For a given 
node with corresponding non-singleton structure A' , the children of A' are the substruc- 
tures A's in the decomposition A! = S'[A's : s G dom(5)]. The complexity of computing 
these trees for particular relational structures has received considerable attention in recent 
years, particularly in the case of graphs because of its applications in optimisation. See, for 
example, [1, 4, 6, 15, 16]. 

The proofs of our main theorems below all follow the same basic pattern: we take an 
arbitrary structure A of the type under consideration, identify its substitution decomposi- 
tion, and then employ an inductive argument to show how adding extra points can be used 
to break the intervals of A. The degenerate case usually needs to be considered separately, 
and it is precisely this case that normally gives our upper bounds. 

2. Tournaments 

Recall that a tournament is an oriented complete graph. An interval in a tournament (often 
called a convex set in the literature) is a set I of vertices such that for every pair x,y a I and 
any z G V{T) \ I, either x ^ z and y — )• z or z — ?• x and z ^ y. 

The name "tournament" derives from its use to describe a competition where every 
pair of players x, y must meet each other in a match, the outcome being either that x wins, 
y — )■ X, or X loses, x — >^ y. This viewpoint then defines an algebra with two idempotent 
binary operations Aj = {T, V, A), so that if x ^ y, then xVy = yVx = x and x Ay = y A 
X = y. An extensive survey of tournaments from this algebraic viewpoint has been written 
by Crvenkovic, Dolinka and Markovic [5]. Of interest for us here is that a tournament is 
simple if and only if its corresponding abstract algebra is also simple, (i.e. it has no proper 
congruences or, equivalently, homomorphic images). Simple extensions of tournaments 
were studied in a string of papers in the early 1970s, and in particular it is known that at 
most two additional vertices are needed: 

Theorem 2.1 (Erdos, Fried, Hajnal and Milner [8]). Every tournament on n vertices has a 
simple extension with at most 2 additional vertices. 
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Proposition 2.2 (Erdos, Hajnal and Milner [9]). A tournament T has a one-point simple exten- 
sion if and only ifT is not finite odd chain with 5 or more vertices. 

Note that these results hold for tournaments of arbitrary cardinality, though they were 
originally proved for finite tournaments by Moon [21]. We give here another proof of the 
finite case using the substitution decomposition, in order to introduce the methods we 
will use in the following sections, and to be able to compare those later results with the 
tournament case. 

Proof. We proceed by induction on |r| = n > 2 by constructing two one-point extensions, 
and T^, satisfying: 

• All proper intervals of = T U {x} and = T U {y} must contain x and y, respec- 
tively. 

• T^ = TU{x}, constructed as described below, is simple unless n = 3 or T is a finite 
odd chain. 

• T-^ = T U {y} is derived from by the rule y — )• y if and only if x ^ w for each 

V G V{T). 

• If n = 3 or T is a finite odd chain, the extension T^^, with edgeset E{T^) U E{T'^) U 
{x y}, is a two-point simple extension. 

The base case is n = 2, for which we may assume T = {u,v} with m — > We define x 
by y — > X — > w, and note that is simple, while has proper intervals {u,y} and {v,y}, 
both of which contain y. 

Now suppose n > 3, and consider the substitution decomposition T = S[As : s G S] 
where the simple quotient S of T has m > 2 vertices. Suppose first that m > 2, in which 
case the tournaments As are defined uniquely, we construct as follows. First, if every 
As contains a single vertex then T = S is simple. Unless |r| = 3 (a case which will be 
covered separately later), the addition of x will preserve simplicity providing it does not 
have the same neighbourhoods as any existing vertex of T and it does not satisfy x — > T or 
T — )■ X. Note that there are 2" — n — 2 ways of choosing such an x. Moreover, since T was 
simple the addition of the vertex y to form can introduce proper intervals only of the 
form {t,y} for t ^ T (note that V{T) cannot form an interval in as then it would also 
have formed an interval in T^, a contradiction). 

Thus we may assume that at least one interval As contains more than one vertex. In 
this case, we add a vertex x which we connect to each nonsingleton interval As inductively, 
forming A]. Now fix any nonsingleton interval As*, and for each singleton interval As form 
As to satisfy x ^ As if and only if As As* . 

Claim. The one-point extension ofT is simple. 
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Figure 1: One-vertex (non-simple) extensions of tournaments on 3 vertices. 

Proof. Note first that by the inductive construction we cannot have x — )■ T or T — )■ x in T^, 
otherwise the extension A\ oi the nonsingleton block Ag would contradict the inductive 
hypothesis. Now suppose 7 is a non-singleton interval of T\ and let u,v G 1 be two 
distinct vertices. The possible positions of u and v vnT^ give rise to three cases. 

Case 1: u G As, v G Ag/ for some s, s' G S with s ^ s' . Then by the simplicity of the 
quotient S we have V{T) C J, and x is then included since we cannot have x ^ T or 
T ^ X. 

Case 2: u G As, v = x, for some s G S. If there exists some other nonsingleton block As' 
of T, then we have either u — > As' or As' — )■ u, but not v — > As' or Ag/ — > v, reducing to 
Case 1. If every other As' is a singleton, then A, is the only nonsingleton block so s = s*, 
and, by the construction, u and v will disagree on every singleton block, again reducing to 
Case 1. 

Case 3: u, v G As for some s G S. By induction, the only possible proper intervals of A] 
are those which involve x, reducing immediately to Case 2. □ 

Consequently, the one-point extension T^, if not itself simple, contains proper intervals 
each of which must contain y, as any other interval would also be an interval in . 

This leaves the case where \T\ = 3 or T = S[Ai, A2] with |S| = 2, Ai ^ A2, and Ai 
and A2 not necessarily uniquely defined. Note that there are no simple tournaments on 
four vertices, and so if |T| = 3 a simple extension will require at least two points. For the 
induction, however, we must create for which every proper interval must include the 
additional vertex, and this is done in Figure 1. Note then that the two- vertex extensions 
T^^ are simple in both cases. 

Thus we may now assume that T has at least four vertices, and T may be written 
(possibly not uniquely) as T = S[Ai, A2]. If T is a finite odd chain, then we may pick 
A2 to be a finite odd chain and Ai a finite even chain. Denote by a the vertex of Ai for 
which Ai \ {fl} — )■ a, and by b the vertex of A2 for which b — ?► A2 \ {&}. We adjoin a 
vertex x to T so that A2 U {x} forms whichever of A\ or A2 ensures x — )■ b, and Aj U {x} 
forms the simple one-point extension A\. The resulting extension is not simple, but we 
note that any proper interval must contain the vertex x, as the only other candidate we 
need to check is {a, b}, which, since we fixed x — )■ cannot be an interval unless x — > a, 
contradicting the simplicity of A] . We observe also that must satisfy the same property, 
and consequently the two-point simple extension T^^ is simple. 

Assuming herein that T is not a finite odd chain, consider first the case where the de- 
composition T = S[Ai,A2] satisfies IA2I = 1. Since Ai (which contains at least three 
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Figure 2: A 2-vertex simple extension of a transitive tournament on 7 vertices. 

vertices) cannot be a chain, there is no a G Ai for which Ai \ {a} — > a. We construct 
= T U {x} so that A2 ^ x and so that = U {x} is, by induction, a simple ex- 
tension of A-[. It is easy to check that is simple, and then can contain only proper 
intervals involving x. A similar argument applies when \Ai \ = 1. 

Finally, if neither = 1 nor IA2I = 1 then by induction we may adjoin a single 
vertex x for which Aj U {x} corresponds to the simple extension A\ and A2 U {x} to A\. 
We claim that the tournament T U {x} gives us T^. Since A| and A2 are simple one-point 
extensions, we need only consider intervals of the form {a,b} with a G Ai and b E A2. 
Since a — > A2, this would imply — > A2 \ {b} from which we can conclude that x — > as 
A2 is simple. Similarly, A\ ^ h implies A\ \ {a} — > a and hence (by the simplicity of A|) 
fl — > X, so fl and & disagree on x, completing the proof. □ 

It is initially surprising that the case where T is a finite odd chain with 5 or more ver- 
tices requires two additional vertices rather than one. Figure 2 shows a two-point simple 
extension of a chain on 7 vertices. 

3. Graphs 

In a graph G, an interval 7 is a set of vertices for which N(m) \ I = N{v) \ I for every 
u, u G 1. A graph is simple if it contains no proper intervals, but it is worth noting that the 
word "simple" is commonly used to describe graphs with no multiple edges or loops, and 
our notion is more often referred to as prime or indecomposable. These graphs have been 
the subject of considerable study, see for example Ehrenfeucht, Harju, and Rozenberg [7], 
lUe [14], and Sabidussi [22]. A survey of indecomposability and the substitution decompo- 
sition in graphs can be found in Brandstadt, Le, and Spinrad's text [2]. Simple extensions 
of graphs have received some attention in the past. In particular, we have: 

Lemma 3.1 (Sumner [23, Theorem 2.45]). The complete graph K„ has a simple extension with 
[log2(n + 1)] additional vertices. 

The bound m = [log2(« + 1)] is also the smallest possible, for were we to add a set B 
of m vertices with n > 2"' — 1, then either two vertices in G have the same neighbourhood 
in G U B, or one vertex of G is connected to every other vertex in G U B, both of which give 
an interval. 

Theorem 3.2. Every graph G has a simple extension with at most m = [log2(|F(G)| + 1)] 
additional vertices. 
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Figure 3: Two point simple extensions in the cases when |G| =2. In each case the 
filled nodes correspond to G, and the empty nodes to B. 

Proof. First note that when |G| = 1, we may trivially extend by 1 vertex to form K2. We 
will prove, by induction on | G | = n > 2 the following statement: G has a simple extension 
with a set B of m independent vertices^, where 2 < m < [log2(n + 1)] • 

The base case n = 2 may be seen in Figure 3, so now suppose n > 3. Write G = 
H[Jy : V G V{H)] where H is the simple quotient of G, and suppose first that |H| > 2. 
If H = G then the graph is already simple, but for the induction we need to add two 
vertices. As in the tournament case, we may preserve simplicity by adding vertices whose 
neighbourhoods in G are distinct and neither the same as any vertex in G, empty, nor all 
of y(G). As in the case of tournaments, are 2" — n — 2 possible neighbourhoods, and for 
n > 4 we may pick any two of these for the new vertices. 

Thus we may now suppose that G 7^ H. By induction, each ]-o of size \]v\ > 2 can be 
extended by some independent set By of size at most [log2 (|y(/i,)| + l)]so that ]-o U By is 
simple. Picking x G V{H) for which = B is largest, for every v ^ xirvH with \ Jy\ > 2 
we identify Bj, with any subset of B of the correct size. For any v E H for which \Jy\ = 1, 
we cormect J-o to any vertex in B that is not adjacent to all of Jx, and label this singleton set 

Claim. The extension G Li B is simple. 

Proof. Consider an interval I with at least two vertices a and b. There are four cases: 

Case 1: a G ]u, h G ]-o for w 7^ u in H. The simplicity of H implies that G C J. In 
particular this gives ]x ^ I, which implies B C. I since U B is simple. 

Case 2: a,b e JyUBy,v ^ x. By the simplicity of Jz, U Bj, we have Jz, U B^, C L If | > 2 
then by induction \By\ > 2 implying that Jx Q I, reducing to Case 1. If, say, = {a} then 
either a is adjacent to everything in /;,; or nothing in Jx, while by the construction b G B-o is 
adjacent to some but not all of Jx, reducing to Case 1. 

Case 3: a,b E JxLi B. We immediately have /x U B C I, and so if there is some v ^ x for 
which \ Jy \ > 2 then Jy C I gives Case 1, so we now assume | = 1 for all u 7^ x in H. Pick 
any such Jy = {c} we know there exist bi,b2 G B with &i ~ c but ^2 7^ c, implying c G /, 
which again reduces to Case 1. 

Case 4: a G Jy, b ^ B\By. Since Jy U By is simple, we know that a is adjacent to some 
c G /i, U By, while b is not, reducing to Case 2. □ 

For the degenerate cases, let us first assume that H = K2, i.e. that G is disconnected, so 
G = H[/i,/2] where /i and J2 may be picked possibly in a number of ways. We arrange it 

^The assumption that B is independent may be replaced by the requirement that B be any graph specified 
in advance, but this adds considerably to the complexity and length of the proof. 
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so that /i is a largest connected component, and suppose first that | /i | > 2. By induction, 
each of /i and }2 may be extended by adding a set of vertices Bi and B2, respectively, unless 
1/21 = 1, where we pick any vertex of B to act as the singleton set 82- Fix x G {1, 2} so that 
B = Bx is the larger of Bi and B2, and associate the other set B^', x' G {1,2}, x' ^ x, with 
any subset of B. If |/i | = 1 then G is in fact K„, and but for the induction we could appeal 
to Lemma 3.1. By induction, there exists a set B2 of [log2 m] vertices for which J2 U B2 is a 
simple extension of J2- Unless [log2(n + 1)] = [log2n] + 1, we may assign a nonempty 
neighbourhood of B2 to the single vertex of /i that is different to the neighbourhood of 
every vertex in J2. In the exceptional case where [log2(« + 1)] = [log2 n] + 1, we assign 
any nonempty neighbourhood of B2 to /i, observing that there exists a vertex v ^ J2 with 
the same neighbourhood. However, we are now permitted to add another new vertex b, 
attached to /i and nothing else, so that v no longer has the same neighbourhood as the 
single vertex in /i . 

In the case that H = K2, only a few modifications need to be made to the above. We 
first pick Ji so that /i is the largest possible connected component. When /i is not a sin- 
gleton but J2 is, we select B2 = {fo} by choosing any b G B for which b is not connected 
to all of /i (as in the non-degenerate case). When /j is a singleton (so G is a complete 
graph), it can have any non-complete neighbourhood of B = B2 (including the empty 
neighbourhood), providing no vertex of J2 already has that neighbourhood. If no such 
neighbourhood exists, then we may add a new vertex to B and connect it to /i . The analy- 
sis of these degenerate cases is similar to and easier than the analysis in the nondegenerate 
case above. □ 

Although this bound is tight for complete and independent graphs, the inductive con- 
struction used in the proof actually does somewhat better in many cases. Unless the graph 
is complete or empty, at each stage of the induction we add only as many vertices as are 
required by any single block f-o of the substitution decomposition. This may be thought 
of iteratively in terms of the substitution decomposition tree: recalling that the degenerate 
cases are handled in this tree by breaking up the graph into as many parts as possible, 
the largest such degenerate node all of whose children are leaves will dictate how many 
additional vertices are needed. If no such degenerate node exists, then the inductive con- 
struction above requires only two additional vertices. 

4. Permutations 

In a permutation tt of [n] an interval is a collection of entries that are contiguous both 
by position and value. If a permutation is viewed as a set with two linear orders < and 
-<, an interval is a set of points that is contiguous under both < and -<. Intervals are 
easily identified in the plot of a permutation as sets of points enclosed in an axis-parallel 
rectangle, with no points lying in the regions above, below, to the left or to the right (see 
Figure 4). To embed a given tt in a simple permutation, therefore, we must ensure that 
every axis-parallel rectangle containing at least two points of n is "broken" by at least one 
of the extension points. 
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Figure 4: An interval in the plot of a permutation. 



Theorem 4.1. Every permutation rconn symbols has a simple extension with at most \{n + 1) /2] 
additional points. 

Proof. We proceed by induction on n > 2, claiming that for each permutation n of length 
n we may construct two extensions n'^ and both containing at most f new points, 
satisfying: 

• Viewed as extensions, each tt" (a G {t/J-}) has a new leftmost point which is neither 
a new maximum nor a new minimum, called the entry point and denoted In(7r''). 

• Each tt" (fl G {t/i}) has a new exit point Out(7r''); for tt^ this is a new maximum 
while for this is a new minimum, and in both cases it is neither a leftmost nor a 
rightmost point. 

• Any proper interval of tt" contains the exit point Out (tt"), but does not contain the 
rightmost point of n. 

• At least one of tt^ and is simple. 

Let us call the permutation obtained from zr" by removing the entry and exit points the 
core of tt" and denote it by Core (tt"). Note that Core (tt") still contains the original copy of 

71. 

In the base case n = 2, either tt = 12 or tt = 21. When n — 12, zr^ = 2413 is simple, 
and the only non-singleton intervals of = 3124 are 12 and 312, both of which contain 
the exit point. The case tt = 21 is dealt with by symmetry. 

Now suppose n > 3. Write n = (r[TC\, 712, ... , TCm] where the simple quotient cr of 7r 
is of length m > 2, and rci, 7Z2, . . .,7Zm are permutations. First suppose m > 2 so that the 
substitution decomposition is unique. If tt/ = 1 for all /, then 7r = C7 is already simple. 
We construct tt^^ and by adding precisely two points. The first is a new leftmost point, 
which may be positioned vertically anywhere except as a new maximum, minimum, or 
adjacent to the leftmost entry of tt. The new maximum or minimum is inserted similarly, 
and it is easily checked that the resulting permutation is simple. 

Now suppose that at least one 71; contains at least two points. The permutations 71^ 
and 71^ will be constructed by extending each non-trivial block rCi of re to one of rcj or nf 
and then 'repositioning' the entry and exit points so that they 'link' successive blocks. The 
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Figure 5: On the left, the substitution decomposition of tt = 264135 [tti, 1, 7T3, 7T4, 1, n^]. 
On the right, the extension tt^ formed from the blocks r" connected by linking points 
^1,^2 and £3. 



choice between tt- and rij is made so as to enable this linking. More formally, to define tt" 
(fl G {t, i}), first write a = Si . . . Sm, and suppose that the non-trivial blocks of tt are, from 
left to right, Tij^, . . . , n^. Next, for each / = 1, . . . , m define 

Core ( 71;' ) if f = z,-, 1 < r < t — 1, and Sj,. < S/^^j 
Core(7r-') if i = U, < r < t — 1, and Si^ > Sj^^j 
Core(7r-') if i = U 
1 otherwise, 

and form the permutation t = cr[T^, . . . , t„,] . For each r = 1, . . . , f — 1 insert a linking point 
£r, the position of which is uniquely determined by the requirement that £,■ is the exit point 
for Tjj, and the entry point for t,^^j . Finally, complete the construction of n" by defining 
io = Ini^n") to be the leftmost point, which at the same time serves as the entry point to 
T,j, while it = Out(7r'') is a new minimum or a new maximum (depending on a) which is 
at the same time the exit point for Tj^. This construction is illustrated in Figure 5. 

Claim. If Tz^ is not simple then the rightmost non-singleton block is the maximal block by 
value, and every non-trivial proper interval of contains Out(7r^). 

Proof. Let 7 be a non-singleton interval of n\ and let u,v G I he two distinct points. We 
consider several cases, depending on the position of u and v irvn^ . 

Case 1: u G Ty, v G Tj., ; 7^ k. The simplicity of a implies that I contains all t, (1 < z < m). 
Each of the remaining points £o/ ^1/ • • • / overlaps at least one T,, and so these points must 
belong to I too. We conclude that I = tz^. 

Case 2: u,v E t,^ for some r G {1, . . . , f — 1}. The set I n (t,^ U £,--1 U ir) is an interval of 
the permutation tt-' = t,^ U £^-1 U (& G {tz-l})/ and so by induction it contains the exit 
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point £r- But ir overlaps horizontally with Tj^^j, and so I must contain a point from 
Thus we have reduced this case to Case 1. 

Case 3: u,v a t„ . As in Case 2 we see that I must contain the corresponding exit point, 
which this time is Out ( 71^). Now, if is the maximal block by value there is nothing 
further to prove, while if is not maximal we see that I must contain a point from some 
other block, and reduce again to Case 1. 

Case 4: u e {^O/^i/ • • ■ ,£t}, v e Xj. From the construction it follows that u and v are 
separated either horizontally or vertically by at least one other point belonging to some r^. 
Thus this case reduces to one of the first three cases. 

Case 5: u,v E {£o> h, - ■ ■ f^t}- Again, we can see that I must contain at least one element 
from some Tj, reducing to Case 4. □ 

Returning to the proof of the theorem, we note that the analogous assertion holds for 
rc^: If TZ^ is not simple then the rightmost non-singleton block tt,, is the minimal block 
by value, and every non-trivial proper interval of contains Out(7r^). Since a is simple 
and of length at least 4, neither the maximal nor the minimal blocks by value can be the 
rightmost block of n, and so the rightmost point of n cannot appear in any proper interval 
of tt". Moreover, as cannot be at the same time maximal and minimal, we conclude that 
at least one of n'^ or n-^ is simple. 

To complete the proof in the non-degenerate case, let us estimate the number of points 
we have added to zr. To each non-trivial block tt,-^. (1 < r < t) we have added at most 
[(ItTjJ + 1)/2] points. Moreover, we merged t — 1 pairs of exit/entry points. Thus the 
total number of additional points does not exceed 

r=l ^ r=\ ^ r=l ^ ^ 

Since the number of points is a whole number, we conclude that it does not exceed \{n + l)/l\, 
as required. 

In the case where m = 2 we may assume without loss that a = 12, and write tt = 
11[tci, tc2\, where tc\ and 712 may be chosen possibly in a number of different ways. Fix 
TC\ so that if possible it does not end with its largest element. If this is possible, and if 
712 is not a singleton, we may construct tt" (a G {t, ^1}) by extending tti to ti\ and 712 to 
712, ^^'^ identifying the exit point of the first to the entry point of the second. If 712 is a 
singleton, then the exit point of 7i\ may be placed above TC2 to form 71^ without producing 
a proper interval between the top of n\ and tcj: the intersection of any such interval with 
n\ would have to form a proper interval in 7i\ containing the rightmost point of tci, which 
is impossible by the inductive hypothesis. We form 71^ simply by taking 12[7rj', 712], with 
the exit point of tti acting as the exit point of 7i. Note that tc^ is not simple since 7r| is an 
interval, but it contains Out (71-'-) and not the single point of K2 and so satisfies the inductive 
hjrpothesis. 
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Figure 6: Simple extensions of 12 ■ ■ ■ n, for n = 12 (left) and n = 13 (right). 



If it is not possible to pick tti so that it does not end with its largest element, then we 
first attempt to pick tz2 so that it does not start with its smallest element. If this is possible, 
and if tti is not a singleton, an analogous argument to that given above applies. On the 
other hand, if tti is a singleton, place all additional points of n!^ that lie to the left of every 
point in 712 to the left of the singleton tti . (This happens, for example, if tt = 132, and we 
form Tz'^ = 35142 and = 41253.) If it is not possible to pick TI2 so that it does not start 
with its smallest element, then tt = 12 •■• n. If n > 3, we set ttj = 12 • ■ • n — 2 so that 
712 forms the pattern 12, whence there is a two-point extension 7T2, a G {f, J,}, in which 
both the entry and exit points occur to the left of Core (712). This permits us to connect the 
exit point of the extension 7r| to the entry point of 712, and the exit point of both acts 
as the exit point of n" and ensures that there are no intervals between and 712. Finally, 
when Tz = 123, we set 7r^ = 31524 and = 42135. In each case, checking that the so 
constructed extensions 7r^ and 7Z^ satisfy the required conditions is similar to (and easier 
than) the non-degenerate case. □ 

See Figure 6 for examples of extensions of monotone permutations. Note that in this 
case the bound is tight: every adjacent pair i, / + 1 must be "separated" either horizontally 
or vertically by one of the additional points, and the points 7r(l) = 1 and 7r(n) = n of 7r 
must not lie in the "corners" of the simple extension — a total of n -|- 1 gaps to be filled, 
and each additional point can be used to fill at most two of these gaps (one horizontally, 
one vertically). 

5. POSETS 

Although posets are very naturally described as relational structures, we have delayed 
considering their simple extensions until now, as with posets we encounter a "mix" of 
the results related to the degenerate decompositions corresponding to permutations and 
graphs. For permutations recall that the degenerate cases correspond to the increasing 
and decreasing permutations, which (viewing them as relational structures) occur when 
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the two linear orders are equal or are the opposites of each other. For graphs, the non- 
uniqueness comes in the form of complete and independent graphs. Posets can be decom- 
posed non-uniquely either through total comparability (linear orders) or total incompara- 
bility (antichains), and the simple extension in each case is significantly different. 

We begin with a result that holds for all posets; we will discuss special cases where 
fewer additional elements are required afterwards. Our approach takes much the same 
form as the permutation case, inductively identifying entry and exit points from the simple 
extensions of the intervals in the substitution decomposition, and this gives rise to the same 
bound. Because of this similarity, we will omit some of the case by case details in the proof. 

Theorem 5.1. A poset (P, <) on n elements has a simple extension with at most \{n + 1)/2] 
additional elements. 

Proof. Extending a poset of size 1 is trivial, so we proceed by induction on n assuming 
n > 2. Our claim is that we may form four extensions P^, P^, P^-^ and P^ of a poset (P, <), 
satisfying: 

• Each extension P" {a G {t, I, tl, It}) has at most \{n + 1) /2] new elements. 

• Among these new points lie two distinguished elements, Exti (P") and Ext2(P''). For P^ 
these are both new minima, for P^ new maxima, for P^^, the point Exti (P^^) is a max- 
imum and Ext2(P^^) is a minimum, while for P^, the point Exti(P-''^) is a minimum 
and Ext2(PJ-'^) is a maximum. Note that P^ and P^ need not be different extensions 
— it is only the labelling of the distinguished elements that matters. The remaining 
points of P" will be called the core, denoted Core(P''). 

• Every proper interval of P" {a G {t/ 1/ ti/ It}) contains Ext2(P''). Additionally, in P^ 
and P'^ there is a minimal element that is not contained in any proper interval, and 
in P^ and P^ there is a maximal element not contained in any proper interval. 

• At least one P" is simple. 

The base case is n = 2, when the poset is either linear or an antichain. The extensions P" 
may be seen in Figure 7, and it is straightforward to check that these satisfy the inductive 
hypothesis. Now suppose that n > 2 and decompose the poset as P = S[As : s G S]. 
Assuming first that \S\ > 2, we proceed in essentially the same way as in the permutation 
case. 

If every Ag is a singleton, then P is already simple, but for the purposes of the induction 
we must add two elements to form the four extensions P^, P^, P^^ and P^. First observe 
that P can contain neither a unique maximum nor a unique minimum element. To form P^ 
we find any two maxima of P, place Exti {P^) above one of them, and place Ext2(P^) above 
both. We form P^ analogously, and P^ and P^ are formed by adjoining two elements to 
any chosen element of P; see Figure 8. Note that all four extensions are necessarily simple. 

When at least one As has more than one element, induction may be used on each such 
interval to form the extensions Af (flg G {t/t/ tt/4t})/ which we utilise to construct P". 
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Figure 8: The four 2-element simple extension of an arbitrary simple poset. The points 
labelled 1 and 2 correspond to the distinguished elements Exti(P") and Ext2(P''), re- 
spectively. 
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The basic idea is to consider the non-trivial blocks in turn, and for each of them determine 
which extension A"' to use so that A"" can share one distinguished point with the previous 
block, and one with the subsequent block. The first and the last blocks will contribute one 
distinguished point each to become the distinguished points of P". To be more precise, 
assume (without loss of generality) that the groundset of S is {1, . . . , |S | }, and that the first 
t of these elements correspond to the non-trivial blocks of P. Define elements Ug e 
/ it}/ 1 < s < as follows: 



fls = < 



t iffl e {t,t;}andl ;^is2 

i if fl =1 and 1 >s 2 

t; iffle{t,t;}andl>s2 

It otherwise 

t if s - 1 s and s ;2^s s + 1 

I if s — 1 <s s and s >$ s + 1 

% if s — 1 s and s >s s — 1 

It otherwise 

t ift-1 i and a =t 

i ift-1 <st and a G {t, tt } 

tt if t -1 ^st anda e {lAl} 

4t otherwise. 



if s = 1, 



if 2 < s < i - 1, or 



if s = t. 



Fori < s < f, we define new, extended blocks Bs = Core(As''). For the remaining elements 
of S, i + 1 < s < |S|, As is the trivial poset on one element, and we set Bg = Ag. 

Now we form the extension S [Bg : s G S] by adding t + 1 linking points 4 (s = 0, . . . , t), 
as follows: Iq is related to the elements of Bi in exactly the same way as Exti(A^^) is 
related to the elements of Bi = Core(A!^^); £s {1 < s < t — 1) is related to the elements 
of Bg in the same way as Ext2(As'') is related to the elements of Bg = Core(As'), and 4 is 
related to the elements of Bj+i in the same way as Exti{A''g_^l) is related to the elements of 
Bs+i = Core(A"^-j'); it is related to the elements of Bf in exactly the same way as Ext2(Aj') 
is related to the elements of Bf = Core(Aj'); these comparisons and their consequences 
via transitivity are the only comparisons that the elements ig satisfy. Finally, we stipulate 
Exti(P«) =4,Ext2(P«) =£t. 

To see that one such P" {a G {t, t, tt/ tt}) is simple, a case analysis needs to be carried 
out, but as this is similar to the permutation case, we omit the details. The only intervals 
that can arise in any such P'^ must contain it and be contained in Bf U {it}, and these 
intervals are permitted by the inductive hjrpothesis for all but one of the extensions. To 
see that there is indeed one extension with no such interval, however, note that by the 
simplicity of S, Bf must be related to some other block Bg of P, and by symmetry suppose 
Bg < Bf. Taking it to be a new minimum (i.e. the extension P" that we are considering 
satisfies a G {J., tt})/ we note that any interval J in Bf U {^f} cannot be an interval in P", 
since there is some x G / n Bf with x > Bg, while it ^ Bg. 

In the degenerate case we have P — S[Ai, A2], where S is the 2-element chain (so 
Ai < A2) or the 2-element antichain (so that Ai and A2 are incomparable). Supposing 
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first that S is a chain, if possible pick Ai so that it has no unique maximum. Assum- 
ing this is possible and that the resulting Ai is not a singleton, identify the distinguished 
points of the extensions and A^^ in the same way as the non-degenerate case, with 
Exti(P'') ^ Exti(A^i) andExt2(P'') = Ext2(A2'). On the other hand, if the resulting A2 is a 
singleton, we form S [Core (A^), A2] and then add the distinguished points to A\ to serve as 
the distinguished points of P". (Note that by the requirement that A contained a maximal 
element that is not contained in any proper interval and that Ext2(P~'^) was contained in 
every proper interval, there are no proper intervals in P-^^.) 

If we cannot pick A\ to have no unique maximum but we can instead pick A2 to have 
no unique minimum, then we may argue by symmetry. This leaves only the case where 
P is a chain, which is handled by appealing directly to the permutation case; the chain P 
corresponds to an increasing permutation n = under the following mapping: z < / in P 
if and only if both / < / and 7r(z) < rc{j). Under this mapping the simple extensions of 
increasing permutations map to simple extensions of linear orders, and by appealing to 
the inductive construction of Theorem 4.1 this allows us to form P^ and P^, one of which 
is simple. We form P^ and by using an analogous mapping that connects chains to 
decreasing permutations. 

This leaves the case where S is a 2-element antichain. By analogy with the case where 
S is a two-element chain, first attempt to pick Ai so that it is not itself decomposable by 
an antichain and, if possible, so that Ai is not a singleton. If this can be done leaving A2 
with I A2 1 > 2, then we proceed by using the same construction as the nondegenerate case, 
while if A2 is a singleton then we form P^ and P^ simply by taking the corresponding 
extensions of Ai and leaving the single element of A2 unrelated to any other elements. We 
ensure that P^ and P^ are simple extensions of P by adding the relation Ext2(Ai) > A2 in 
P^ and Ext2(Ai) < A2 in P^. This leaves only the case where P is an antichain, which is 
covered by appealing to the connection between posets and permutations. □ 

As we may expect by its similarity to the permutation case, precisely [(n + l)/2] ad- 
ditional elements are required when (P, <) is a linear order. Note further that this bound 
is tight, because of its connection with permutations. 

On the other hand, when (P, <) is an antichain, a simple extension does not require 
this many points by the following connection with graphs: the comparability graph G(P, <) 
of a poset (P, <) is a graph with vertex set P, and edges p ^ cj if and only if either p < cj 
or q < p? It is then easy to appeal to Theorem 3.2 to create simple extensions of antichains 
that require just [log2(n + 1)] additional points, and again this bound is tight. In fact, if 
the simple extension of a graph admits a transitive orientation then this extension could 
be interpreted as a simple extension of a poset. 

^The converse operation — forming a poset from a graph — is called a transitive orientation of (the edges 
of) G. This connection between posets and graphs arises in a number of combinatorial problems — see 
Mohring [19] for a survey. 
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6. Digraphs and Oriented Graphs 

An arbitrary digraph corresponds to the most general type of binary relational structure, 
i.e. the structure A contains just one binary relation R for which there are no restrictions. 
Elements of the relation may be viewed as directed edges on a graph, so that two vertices 
u and y of a digraph may be related in one of four different ways: m — > t;, u z;, m o v, or 
u and V have no edge between them. 

We obtain a general bound on the number of additional vertices required by analogy 
with the graph case: where we had two possible "types" of connection (i.e. an edge or no 
edge) between an old and a new vertex for graphs, we now have four. The result of this is 
that a new vertex in a digraph can do the work of two new vertices in a graph, so half as 
many vertices are required. 

Theorem 6.1. A digraph D has a simple extension with at most m = [log4(|y(D) | + 1)] addi- 
tional vertices. 

Proof. The underlying graph Go of a digraph D is the graph formed by replacing each 
directed edge with an undirected one, i.e. uv G E(G) if and only if m — )■ f or y — >^ w. 
Moreover, it is easy to see that a set of vertices that forms an interval in D will also form 
an interval in Go- By Theorem 3.2, we can construct a simple extension G^ of Gd by 
adding a set B = {bi,b2,. . . ,hm} of m < [log2(|V(GD)| + 1)] vertices. Now, noting that 
2[log4(|y(D)|+l)] > [log2(|y(GD)|) + ll,weaddasetC = {ci,...,caof£ = \m/2] 
additional vertices to Dg, satisfying, for every v G V{D) and i = 1,2, . . . ,£, 

V — > Ci if and only if vb2i-i G £(G5), and, 
Ci V if and only if vb2i G E{G^) and 2i < £. 

Thus every vertex c, of C has outneighbourhood matching the neighbourhood of fc2!-i/ 
and inneighbourhood matching b2i. It is now straightforward to check that this is a simple 
extension. □ 

This bound can be seen to be tight for a complete digraph — a digraph for which w o y 
for every pair of distinct vertices u and v — and an empty digraph, again by analogy with 
the graph case. 

However, if our digraph forms a linear order (and is hence transitive), our structure 
forms a transitive tournament. Appealing to Theorem 2.1, only one additional point is re- 
quired to form a simple tournament unless the tournament had an odd number of vertices, 
when two are required. Here, however, we do not require the resulting simple extension to 
be a tournament, and this added freedom allows us to form a simple one-point extension 
in every case. See Figure 9 for an example. 

Extending the cormection with tournaments, any digraph that is also a tournament 
must consequently have a simple one-vertex extension. A natural question then arises: 
which digraphs have simple extensions requiring only one additional vertex? Clearly 
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structures which are "close" to tournaments will have this property, and again the an- 
swer must lie in the substitution decomposition tree, and in particular how we decompose 
a structure with a quotient of size 2. 

Oriented Graphs An oriented graph G is a graph in which each edge has a specified direc- 
tion, and it can be described as a relational structure Aq on the language £ = {— >,n^=2} 
where — > is an asymmetric binary relation, i.e. for each m, w G dom(^G) = V{G), u ^ v 
implies v -/^ u. The difference between these and digraphs is that oriented graphs per- 
mit only three types of connection between two vertices, and so in particular there is no 
oriented graph corresponding to a complete graph. There is, however, an oriented graph 
corresponding to the empty graph, and it is this that gives rise to the upper bound. 

Theorem 6.2. An oriented graph G has a simple extension with at most m = [logg (| V (G) | + 1)] 
additional vertices. 

When the oriented graph forms a tournament, it is easy to check that only one addi- 
tional vertex is required — note in particular that the simple extension in Figure 9 is also 
an oriented graph. 

7. Relations with Higher Arity 

When a relational structure is defined on a language containing a fc-ary relation for some 
k > 3, simple extensions are often trivial to find and depend only on the type of relation. 
We begin with a straightforward observation. 

Lemma 7.1. A relational structure A on the language £ = {Ri, R2, . . . , mr^, n^^, . . .} is simple 
if there is some kfor which the structure ^r^, defined on the ground set dom(^R^) = dom(^) 
and language Cr^ = {Rjc/^Rt}' simple. 

Proof. Suppose, for a contradiction, that 7 is a nontrivial interval of A, and in partic- 
ular we may choose two distinct elements x,y S I. Since Ar^. is simple, there exists 
z G dom(^Rj, ) \ I = dom(^) \ I for which x and y are not interchangeable in any re- 
lationship of Rk containing z and at least one of x and y. This, however, is also true in 
A. □ 

Subsequently, if a relational structure A contains an arbitrary k-ary relation R with 
k > 3, only one additional element is required, regardless of what other relations the 
structure may hold: 
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Theorem 7.2. A relational structure A containing an arbitrary riR-ary relation R with ur > 3 
has a one-element simple extension. 

Proof. By Lemma 7.1 we need only prove that we may form a one-element simple exten- 
sion for a structure A defined on the language C = {R, consisting of only one relation. 
Furthermore, we assume that mr = 3, noting that the result is easily extended if our chosen 
arbitrary relation has higher arity. 

Choose any simple binary relational structure B on the set dom(^) with language 
{B, Wb = 2} — such a structure exists for every size of set by taking, for example, a path in 
the graph-theoretic sense unless | .4 1 < 3, in which case we may choose a simple tourna- 
ment. We now add an additional vertex xto A and define relations between x and dom(.A) 
to form A* as follows: for every {u,v) G B^, put {x,u,v) G R-^* . (Note that if > 3 we 
set (x, . . . , X, u, v) G R-^* .) We claim that A* is simple. By the simplicity of B, any interval 
containing more than one element of dom(^) must contain all of dom(^), so the only 
intervals we need to rule out are I = {w, x} for some iv G dom(^), and I = dom(^). 

For the first case, by the construction there exists at least one relationship of the form 
{x,w,v) or {x,v,iv) in R-^* for some v G dom(^) with v ^ w, and without loss we may 
assume the former. Since I is an interval, this implies that (x, x, v) G R^* , a contradiction 
as there are no relationships of the form (x, x, •) in R-^* . In the second case, I = dom(yl) 
implies that (x, u, v) G R'^* for every w, u G dom A, which implies that B can be thought 
of as a complete graph, a contradiction since B was chosen explicitly to be simple. □ 

Structures which have more restrictive relations do also sometimes have one point ex- 
tensions. For example, a suitably large relational structure containing a /c-ary relation for 
some fc > 3 that is irreflexive in its entries is easily found to have such an extension: 

Theorem 7.3. for k >3,a relational structure with n > k elements on a k-ary irreflexive relation 
has a one vertex simple extension. 

Proof. Fix k > 3, and let R be a fc-ary irreflexive relation. By Lemma 7.1 it suffices to form 
a one-element simple extension A* of a relational structure A on the language C = {R,k} 
whose ground set consists of n > k vertices. Set dom^* = dom^ U {x}, and define the 
set of relations R-^* as follows: (xi, ... ,Xk) G R-^* if and only if either (xi, . . . , Xj.) G R^ or 
X = X; for exactly one / and (xi, . . . , x,_i, x,+i, . . . , X]^) is a (fc — l)-tuple of distinct elements 
from dom(^). 

It follows routinely that A* is simple: suppose that 7 is a proper interval of A* con- 
taining M, u G dom(^*). By the construction there is at least one relation in R-^ of the 
form (m, z7, X3, . . . , Xk) where X3 ^ I, but this then means that (w, w, X3, . . . , x^) G R'^*, a 
contradiction since R is irreflexive. □ 

Note in particular that the above proof immediately answers the problem for hyper- 
graphs since the construction ensures that if ^ is a hypergraph then so is A* (i.e. R-^* is 
symmetric in its entries). 
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8. Concluding Remarks 

Tight bounds. Although the bounds in this paper are tight in the sense that there are 
certain structures which require as many points as the bound permits, on a typical exam- 
ple our constructions do quite a bit better. As noted in Section 3, more than two vertices 
are required in the graph case only if there is a large degenerate node in the substitution 
decomposition tree, all of whose children are leaves. Similar analysis of the substitution 
decomposition tree may be used in the cases of permutations, oriented graphs and di- 
graphs to determine exactly how many points will be required in any particular case. More 
complicated is the poset case: we cannot simply appeal to the permutation case because 
degenerate decompositions whose quotients are antichains can be treated as if they are 
empty graphs, which require only 0(log2 n) additional vertices. 

Average number of additional points required. In the case of graphs and any relational 
structure defined on a single asymmetric relation (e.g. tournaments, posets and oriented 
graphs), almost all structures are already simple (see Mohring [17]) and hence no points 
are required. The permutation case is more interesting, because here one expects to find 
two intervals of size two (see, for example, [3]). 

Simple extensions of infinite structures. Proposition 2.2 as proved by Erdos, Hajnal and 
Milner applies to tournaments of arbitrary cardinality, and it is natural to wonder what (if 
anything) can be said for other structures. Note that for many such structures it is easy 
to construct examples that require infinitely many additional points (e.g., the countably 
infinite complete graph). On the other hand. Theorem 7.2 holds so long as the structure 
contains an arbitrary WR-ary relation with Ur > 3. 
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